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O:\SPECS\JPD\SUSSEXl.lwp 

Peptides 

This invention relates to protein structures and to methods of producing those protein 
structures and to protein fibres and other materials and assemblie s produced using those 
protein structures. 

The process of molecular self-assembly is central to all biological systems and is assuming 
increasing importance and application in biotechnology (L. Q. Gu, et al (1999) Nature 398, 
686) and nanotechnology (K. E. Drexler, (1999) TEBTECH 17, 5). The characterization of 
natural biomolecular assemblies motivates and directs the development of model 
self-assembling systems and, in turn, these advance our understanding of biology. For 
proteins at least, the coiled coil is arguably the simplest self-assembling system. Coiled 
coils are protein-folding motifs that direct and cement a wide variety of protein-protein 
interactions (A. Lupas, (1996) Trends Biochem. Sci 21, 375). In structural terms, coiled 
coils are relatively straightforward: they are a-helical bundles with between 2 and 5 strands 
that can be arranged in parallel, antiparallel or mixed topologies. The basic sequence 
features that guide the formation of coiled coils from, peptides are reasonably well 
understood (P. B. Harbury et al (1993) Science 262, 1401; D. N. Woolfson and T. Alber 
(1995) Protein Sci. 4, 1596). For instance, most coiled-coil sequences are dominated by a 
7-residue repeat of hydrophobic (H) and polar (P) residues, (HPPHPPP)„, known as the 
"heptad repeat". When configured into an a-helix this pattern gives an amphipathic 
structure, the hydrophobic face of which directs oligomer-assembly. Furthermore, both the 
number and the direction of chains within a coiled-coil bundle is determined predominantly 
by residues that form or flank the hydrophobic core namely, residues at the first, fourth, 
fifth and seventh positions of the heptad repeat. For instance, coiled coils which form 
dimers (i.e. two-stranded assemblies) usually have A isoleucine or valine residue at the first 
position and a leucine residue at the fourth position. By contrast, coiled coils that form 
trimers (i.e. three-stranded assemblies) often have the same residues (i.e both isoleucine or 
both leucine) at both "H" positions. Finally, hetero-oligomers (that is coiled coils made 
from strands with different amino-acid sequences) may be directed by complementary 
charged interactions that flank the hydrophobic core. For these reasons, there have been a 



number of successful de novo protein designs based on the coiled coil These include some 
ambitious structures that extend the natural repertoire of coiled-coil motifs (S. Nautiyal et 
al (1995) Biochemistry 34, 11645; A. Lombardi et al (1996) Biopolymers 40, 495; D. H. 
Lee et al (1996) Nature 382, 525; P. B. Harbury et al (1998) Science 282, 1462; J. P. 
JSchneider_e^^ 

Other relevant aspects of coiled coil structure are described in W099/1 1774, the disclosure 
of which is incorporated herein by way of reference. 

This understanding of coiled coils, and the resulting protein designs, centres on short 
structures as exemplified by the leucine-zipper motifs (E. K. O'Shea et al (1989) Science 
243, 538; E. K. O'Shea et al (1991) Science 254, 539), which are found in a variety of 
transcription factors. In contrast, most natural coiled coils extend over hundreds of amino 
acids (A. Lupas (1996) Supra; J. Sodek et al (1972) Proc. Natl Acad. Set U.S.A 69, 3800) 
and many assemble further to form thicker, multi-stranded filaments (H. Herrmann and U. 
Aebi (1998) Curr. Opin. Struct. Biol 8, 177). 

With the goal of making elongated structures to improve our understanding of coiled coils, 
and to develop protein-design studies, two 28-residue peptides were initially designed — 
dubbed Self- Assembling Fibre peptides, SAF-pl and SAF-p2 — to fold and form extended 
fibres when mixed. Focusing on the buried, hydrophobic-core positions of the structure, 
rules were incorporated to direct parallel dimer formation and to guard against alternative 
oligomers and topologies (P. B. Harbury et al (1993) supra; D. N. Woolfson and T. Alber 
(1995) supra; L. J. Gonzalez et al (1996) Nature Struct. Biol. 3, 101 1). The building block 
of the design was a staggered heterodimer with overhanging or "sticky" ends. This 
contrasts with and distinguishes it from the natural and designer coiled-coil assemblies that 
have been characterized to date, in which the polypeptide strands align in-register, i.e they 
have blunt or "flush" ends. Complementary core interactions and flanking ion-pairs were 
incorporated into the overhangs to facilitate longitudinal association of the heterodimers 
(Figs. 1&2). This principle of using "sticky ends" is well developed in molecular biology 
for assembling DNA (S. J. Palmer et al (1998) Nucleic Acids Res. 26, 2560), and has been 
used to design intricate DNA crystals (E. Winfree et al (1998) Nature 394, 539). However, 
to our knowledge, our application of sticky end-directed molecular assembly to peptides is 
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new; although we do note that head-to-tail packing of helices has been observed in recently 
solved crystal structures for two designer peptides (N. L. Ogihara et al (1997) Protein Set 
6, 80; G. G. Prive et al (1999) Protein ScL 8, 1400). These were helical peptides that 
crystallised with their helical ends in contact so as to form pseudo-continuous helices in the 
~so1id^taterin^ther words-th^ — - 

According to one aspect of the invention there is provided a protein structure comprising a 
plurality of first peptide monomer units arranged in a first strand and a plurality of second 
peptide monomer units arranged in a second strand, the strands preferably forming a coiled 
coil structure, and in which a first peptide monomer unit in the first strand extends beyond 
a corresponding second peptide monomer unit in the second strand in the direction of the 
strands. The protein structures of the invention have numerous advantages. For example, 
relatively long protein fibres can be formed with little material - 1 (il of a 100 |iM solution 
of the peptide monomers may provide enough material to form 10 m of continuous fibre 50 
nm thick. 

At least one charged amino acid residue of the first peptide monomer unit may be arranged 
to attract an oppositely-charged amino acid residue of the second peptide monomer unit. 
Preferably, the charged amino acid residue is in an end portion of the first peptide 
monomer unit, which extends beyond the corresponding second peptide monomer unit in 
the second strand. At least one strand may consist solely of first or second peptide 
monomer units respectively i.e homogenous strands. Heterologous strands are also 
contemplated. The peptide monomer units may comprise a repeating structural unit. 
Preferably, the repeating structural unit comprises a heptad repeat motif, preferably having 
the pattern: 

abcdefg 

the repeat may include isoleucine or asparagine at position a and leucine at position d. 
Other repeats (e.g hendecads - abcdefghijk) and amino acid compositions may be used (see 
W099/11774). 

Preferably, the heptad repeat comprises oppositely-charged residues at postions e and g 
respectively. The oppositely-charged residues may be, for example, glutamic acid and 



lysine residues or arginine and aspartic acid. The use of synthetic amino acids, such as 
ornithine is also envisaged. 

A protein structure in accordance with the invention may be also stabilised by pairs of 
asparagine residues in the "a" positions provided by corresponding first and second peptide 
monomer units. 

In a preferred protein structure, the first and second peptide monomer units have the 
following sequences: 

a) KI AALKQKI AS LKQE I DALE YENDALEQ (SAF-plC) and 

b) KIRALKAKNAHLKQE I AALEQE I AALEQ (SAF-p2D) respectively; or 

c) KIAALKQKIAALKQE I DALE YENDALEQ (SAF-plA) and 

d) KIRALKWKNAHLKQE I AALEQE I AALEQ ( SAF-p2C) respectively; or 

e) KIAALKQKIASLKQE I DALE YENDALEQ (SAF-plC) and 

f) KIRALKWKNAHLKQE I AALEQE I AALEQ (SAF-p2C) respectively. 

According to another aspect of the invention, there is provided a method of producing 
protein structures, the method comprising providing a mixture of first and second peptide 
monomer units which associate to form a protein structure according to the invention. The 
structure can be derivatised and/or stabilized by cross-linking. 

The invention also provides protein fibres produced by an association of protein structures 
according to the invention. 

According to another aspect of the invention, there is provided a kit for making protein 
structures, the kit comprising first and second peptide monomer units which associate to 
form a protein structure or protein fibres according to the invention. 

The protein structures of the invention may be assembled in two and three dimensional 
arrays. For example, two dimensional mats can be formed which can function, for example 
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as filters. Three dimensional grids can also be formed again for example, for use as filters 
or for organising other associated or conjugated molecules in three dimensions. 

Functional moieties may be appended to the amino acids of the peptide monomer units. 
For example, fluorescent moieties (fluorophores) may b e attached to the coiled coil as 
described in W099/1 1774. The addition of fluorescent moieties may assist visualization of 
the protein structure. Substitution with functional groups at the "f ' position in the heptad 
repeat is especially preferred as that position is on the outside of the helix (see Fig. 1C and 
IE). 

The stability of the protein structures at higher temperatures may be improved by making 
the peptide monomers longer, such that the overlap between corresponding first and second 
monomer unit residues is increased. Increases in monomer length have previously been 
shown to stabilize coiled coil structures. Alternatively, stability can be improved by 
introducing bonding between adjacent peptide monomer units in the same strand. For 
example, Kent (Dawson et al (1994) Science 266: 776) and co-workers have produced 
peptide bonds between adjacent polypeptide units by coupling and subsequent 
rearrangement of a cysteine residue at the N end of one polypeptide unit to a thio-ester 
derivatised C-terminus of another unit. 

Definitions 

The terms used in the specification are to be given the ordinary meaning attributed to them 
by the skilled addressee. The following is given by way of clarification: 

Amino acid. 

This term embraces both naturally-occuring amino acids and synthetic amino acids as well 
as naturally-occuring amino acids which have been modified in some way to alter certain 
properties such as charge. In all cases references to naturally-occurring amino acids may be 
considered to include synthetic amino acids which may be substituted therefor. 



Coiled Coil 



A coiled coil is a peptide/protein sequence with a contiguous pattern of hydrophobic 
residues spaced 3 and 4 residues apart, which assembles (folds) to a multi-meric bundle of 
helices. 

Dimer 

A dimer is a two stranded structure. 
Heterodimer 

A heterodimer is a dimeric structure formed by two different stands. 
Staggered heterodimer 

A staggered heterodimer is a structure in which the two strands assemble to leave 
overlagging ends that are not interacting within the heterodimer. 

Protofibril 

A protofibril is a protein structure assembled longitudinally from staggered heterodimers 
interacting through their overhanging ends. 

Fibre 

A fibre is a structure formed by lateral association of two or more protofibril. 

Protein structures and methods of producing protein structures in accordance with the 
invention will now be described, by way of example only, with reference to the 
accompanying Figures 1 to 4 in which: 

Fig. 1 illustrates the design and the sequences of self-assembling fibre (SAF) peptide 
monomers of the invention. 

Fig. 2 illustrates computer modeling of the designed self-assembling fibre of the invention. 

Fig. 3 illustrates the results of circular dichroism (CD) and linear dichroism (LD) 
experiments on protein structures of the invention. 



Fig. 4 illustrates the assembly of synthetic protein fibres visualized directly by transmission 
electron microscopy and an analysis of fibre width In all panels, the white scale bars 
represent 100 nm. Fig. 4D is a histogram showing the distribution of fibre widths 
determined using TEM for fresh (white bars) and matured (black bars) mixtures of SAF 
pepfide^nt)07rM^ 
from "(*-5) to jc"). 

1) Peptide Design and Synthesis 

Various peptide monomer units were designed as described above. The monomers and 
capping peptides (designed to complement the sticky ends of the monomers so as to 
produce flush, or blunt ends and, so, arrest longitudinal fibre assembly) are set out in Table 
1: 
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Fig. 1 shows (A) A mechanism for self-assembly: complementary charges in "companion" 
peptides direct the formation of staggered, parallel heterodimers; the resulting "sticky" ends 
are also complementary and promote longitudinal association into extended structures. Fig 
1(B) shows the designed amino acid sequences: each peptide comprised canonical hep tad 
repeats (abcdefg) with lie at a and Leu at d to guide the formation of coiled-coil dimers; 
oppositely-charged residues were incorporated at e and g to favour the staggered dimer with 
sticky ends; asparagine residues (which preferentially pairs with each other at a sites 
(Gonzalez L et al (1996) Nature Structural Biology 3, 13: 1011-1018) were included to 
cement the prescribed register further and to favoor the parallel structures. Fig. 1(C) is a 
helical-wheel representation, summarizing the designed sequences in context. The view is 
from the N-terminus with heptad sites labeled a-g and assumes 3.5 residues per helical turn to 
emphasise the heptad repeat. 

The peptides were synthesized on an Applied Biosystems 432A Peptide Synthesizer using 
solid-phase methods and Fmoc chemistry. Peptide samples were purified using 
reversed-phase HPLC and their identities confirmed by MALDI-TOF mass spectrometry. 



Various combinations of peptide monomers and capping peptides were tested as set out in 
Table 2: 
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2) Modeling of Protein Fibre Structure 

A model of the three-dimensional structure of the designed protein fibre resulting from the 
assembly of SAF-pl and SAF-p2 was made from the minimized structure of a model 
coiled-coil 35-mer, (LAALAAA) 5 , which was generated using Crick's Equation and had an 
-ideally-packed interface (G. Offer and R. Sessions, J. Mol Biol 249, 967 (1995)). Copies of 
the 35-mer were superimposed with an overlap of one heptad repeat to extend the structural 
template, and the backbone was rejoined after removal of overlapping segments. Residues in 
the two-stranded template were replaced with the sequences of the SAF peptides, staggered 
relative to each other by two heptad repeats according to the alignment in Fig. IB. The 
structure was soaked in a 5 A layer of water and energy minimized until the average absolute 
derivative of coordinates with respect to energy fell below 0.01 kcal A" 1 . The structure was 
built and visualized using Insight II 97.0 (Molecular Simulations Inc.), and was 
energy-minimized using Discover 2.9.8 (Molecular Simulations Inc.) with the consistent 
valence forcefield. In Fig 2(A) peptides SAF-pl and SAF-p2 (each coloured dark 
grey-to-light grey from the N-terminus) interact through core residues including asparagine 
pairs (coloured mid-grey) to form the two strands of a staggered, parallel, coiled-coil fibre. In 
Fig. 2(B), negatively charged glutamate side chains (coloured light grey) and positively 
charged lysine side chains (coloured black) form complementary charge interactions between 
the SAF peptides. 

3) Circular Dichroism Experiments 

Peptide samples were incubated at 5°C in 10 mM MOPS (3-(N-Morpholino)propanesulfonic 
acid), pH 7. Sample concentrations were determined from their UV absorbance at 280 nm 
(SAF-pl) and 214 nm (SAF-p2), After baseline correction, ellipticities in mdeg were 
converted to molar ellipticities (deg cm 2 dmol-res" 1 ) by normalizing for the concentration of 
peptide bonds. Data were recorded in a cell of 1 mm path length by integrating the signal for 
5s (and Is for the fresh 100 jiM peptide mixture) every nm in the range 205-260 nm. CD 
measurements were made using a JASCO J-715 spectropolarimeter fitted with a Peltier 
temperature controller. 

The CD data shown in Fig. 3 provides spectroscopic evidence for the formation of helical 
structures by the SAF peptides. Fig. 3(A) shows circular dichroism (CD) spectra at 10 p.M 
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for: SAF-pl (- — ), SAF-p2 (- -), the average of these spectra ( ), and the experimental SAF 

peptide mixture (O). Fig 3(B) shows CD spectra at 100 (iM - the key is the same as for Fig 
3(A), but with the additional spectrum (•) being for the SAF peptide mixture after 
"maturation" for 1 h. 



Consistent with our design, neither SAF-pl nor SAF-p2 was highly structured in aqueous 
solution at pH 7 and 5 °C (Fig. 3). However, when mixed in equal proportions the circular 
dichroism (CD) spectrum changed and, moreover, was markedly different from the 
theoretical spectrum generated by averaging the spectra for the isolated peptides. In 
particular, the spectrum for the mixture had intense minima at 208 and 222 nm consistent 
with the formation of a-helical structure, but these features were not as pronounced in the 
spectra of the individual peptides. This was clear evidence that the two peptides interacted to 
form an a-helical structure as designed. Furthermore, and as expected for a multimerization 
event, the magnitude of these spectral changes depended on peptide concentration; a SAF 
mixture with 10 of each peptide, did show a weak signal indicative of some a-helical 
structure, however, a 100 (iM mixture gave a much stronger signal (Figs. 3A&B). 

The shape and intensity of spectra from 100 jiM mixtures of the SAF peptides also changed 
with time (Fig. 3B). Spectra recorded immediately after mixing a "fresh" sample displayed 
some a-helical structure. After incubation of the mixture for 1 hour at 5 °C ("maturation"), 
however, the signal at 222 nm was more intense, and indicated approximately 75 % a-helix, 
consistent with substantial coiled-coil formation. 

Maturation of 100 \iM SAF peptide mixtures was also accompanied by slight clouding of the 
samples. Scattering effects from such samples can lead to attenuation and distortion of CD 
spectra (D. Mao and B. A. Wallace, (1984) Biochemistry 23, 2667). However, we could 
disregard this possibility because altering the distance between the sample and the detector in 
the CD instrument did not affect the shape or the intensity of the spectrum. Furthermore, we 
established that the majority of the CD signal from the mixtures derived from the suspended 
material: a supernatant without the suspended material, which was recovered by 
centrifugation of a matured 100 jiM SAF mixture, gave only a weak CD signal similar to the 
10 (iM mixture. 
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Thus, the CD data were wholly consistent with the desired a-helical SAF design and, 
moreover, indicated the formation of large assemblies. 

4) Linear Dichroism Experiments 

-Iinear-dichroism^LD-)^pectmiLcnp^ was_us_ed_tQ^t£St_if_elang ated structures were be ing 
formed as designed. Long polymers such as DNA molecules can be oriented by shear flow. 
This effect can be monitored by LD spectroscopy provided that chromophores also become 
aligned by the flow (M. Bloemendal (1994) Chem. Soc. Rev, 23, 265; A. Rodger and B. 
Norden (1997) Oxford Chemistry Masters (Oxford University Press, Oxford), vol. 1). 

Peptide samples were prepared for LD as for CD. LD data were collected on samples 
spinning in a couette flow cell by integrating the signal for 2 s every nm in the range 210-320 
nm, using a JASCO J-715 spectropolarimeter. After baseline correction, absorbance was 
converted to molar extinction coefficient (1 mol-res" 1 cm" 1 ) by normalizing for the 
concentration of peptide bonds. A linear correction for a sloping baseline was made to the 
data from the 100 jiM SAF peptide mixture. 

The results are depicted in Fig. 3D, which shows linear dichroism (LD) spectra for: 20 fiM 

tropomyosin ( ), the SAF peptide mixture at 10 \xM ( — ), and the SAF peptide mixture at 

100 in the absence (•) and presence (o) of 0.5 M KF. 

For instance, we found that tropomyosin, which forms a dimeric coiled coil approximately 42 
nm in length, could be aligned to give a LD signal (Fig. 3D). In contrast and consistent with 
our design and the CD data, experiments with a 10 jiM SAF mixture, (Fig. 3D), and for the 
individual peptides at 100 )iM (data not shown), LD signals were not detected. However, a 
matured 100 SAF peptide mixture gave a strong absorbance from the peptide backbone 
(210-240 nm) and some signal in the aromatic region (260-290 nm) during flow orientation 
(Fig. 3D). As only long structures are aligned by this technique, the data demonstrated that 
long fibres at least 500 nm in length were present in solutions of the matured 100 |uM SAF 
peptide mixtures. 



5) Electron Microscopy 



To confirm fibre assembly, we used electron microscopy to visualize structures in the peptide 
preparations directly. For TEM experiments, peptide samples were incubated for 1 h at 5 °C 
in filtered 10 mM MOPS, pH 7. A drop of peptide solution was applied to a carbon-coated 
copper specimen grid (Agar Scientific Ltd, Stansted, UK), and dried with filter paper before 
negative staining with 0.5% aqueous uranyl acetate and then dried at 5 °C. A "fresh" SAF 
peptide mixture was prepared by mixing preincubated solutions of the individual peptides at 
200 |iM directly on the specimen grid, before drying and negative staining as described. 
Grids were examined in a Hitachi 7100 TEM at 100 kV and digital images were acquired 
with a (800 x 1200 pixel) charge-coupled device camera (Digital Pixel Co. Ltd., Brighton, 
UK) and analyzed (Kinetic Imaging Ltd., Liverpool, UK). 

For scanning electron microscopy (SEM) experiments, negatively-stained specimen grids 
were sputter-coated with gold and examined in a Leo Stereoscan 420 SEM at 20 kV and with 
a probe current of 10 pA. 

No structures were visible up to 100 000 times magnification by transmission electron 
microscopy (TEM) for either the 10 ^M SAF mixture, or for the individual peptides at 100 
|uM concentration (data not shown). However, TEM of a 100 fiM SAF mixture at 50 000 
times magnification revealed time-dependent formation of long fibrous structures, consistent 
with the CD and LD data. Fresh mixtures showed large numbers of extended fibres of 
various widths. The majority of these had a diameter of about 20 nm (Figs. 4A (a fresh 
mixture at 100 jiM) & Fig 4D); finer fibres were present, but their widths could not be 
measured reliably. Images recorded for the matured mixtures showed fewer fibres, but these 
were more distinct and thicker than those observed in the fresh mixture (Fig. 4B&D). 
Scanning electron microscopy (SEM) of a matured mixture showed no evidence for fibre 
branching. Rather, the fibres were simply intertwined as if layered on top of each other (Fig. 
4C). It was not possible to follow the full length of fibres due to intertwining, but they were 
at least several hundred microns in length. Although the density of fibres varied across the 
surface of the EM grid, for the matured samples at least, their diameters were quite uniform 
with a mean width of 43.3 (SD = 9.3) nm (Fig. 4D). As the original design was for a 
longitudinally extended, but otherwise two-stranded coiled coil the average diameter that we 
might have expected was about 3 nm. Therefore, the EM data suggested that the designed 
two-stranded coiled-coil fibres associates laterally into higher order assemblies. 
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6) Effect of Potassium Fluoride on Protein Fibre Assembly 

Molecular modeling of the SAF sequences into an extended two-stranded coiled coil also 
highlighted potential complementary charge interactions on the surface of the protofibrils, 
Figs 1&2. In accord with this, experimentally it was found that moderate concentrations of 
salt inhibited protofibril and thick fibre assembly. First, CD spectra recorded for both the 

4ndwidual^eptides^nd^^ 

fluoride showed reduced helical CD signals and there was no evidence of "maturing" in the 
mixed samples (Fig. 3C). Second, the LD signal described previously for the matured 100 
|iM SAF peptide mixture was also lost when the experiment was repeated in the presence of 
salt (Fig. 3D). Finally, TEM images of a 100 nM SAF mixture also demonstrated that fibres 
were not formed in the presence of 0.5 M KF (Fig. 4E). Fig. 4E shows the results of TEM of 
a matured SAF peptide mixture at 100 |iM incubated in the presence of 0.5 M KF. 

The inventors did not knowingly design any features into the SAF peptides to foster further 
association of the two-stranded coiled coils. The observation of thick fibres in SAF peptide 
preparations, therefore, raised the question: what interactions guided and stabilized these 
higher-order assemblies? The inventors therefore propose that features inherent in repeating 
structures of the type that they designed will naturally promote such fibre assembly 
(fibrillogenesis). 

Consider a protofibril as depicted in Fig. IB and 2 A. Any sequence feature presented on its 
surface by either, or both of the constituent peptides will be repeated at regular intervals along 
the protofibril The repeat length will be equal to the length of the peptides (for SAF-pl and 
SAF-p2 this was 28 residues, or about 4.2 nm). Furthermore, the motif will spiral around the 
protofibril tracking the superhelix of the coiled coil, which has a pitch of about 15 nm for a 
contiguous, heptad-based, dimeric structure. In this scenario, protofibril-protofibril 
interactions may be promoted if another sequence motif complementary to the first is present 
in the potential partner. This is because the pitches of the complementary motifs on each 
protofibril will match precisely. Thus, once initiated, lateral association of protofibrils — 
that is, fibrillogenesis — will be cemented by many regularly spaced interactions as in a 
crystal. As a result, the complementary interactions need only be weak as the stability of the 
protofibril-protofibril interaction rests on an avidity effect rather than a small number of 
strong interactions. Provided that the components of the assembly can make more than one 
type of complementary surface very extensive molecular assemblies may result. 
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The inventors used electrostatic interactions both to direct heterodimer formation, and to 
promote elongation of the protofibrils (Figs. 1 and 2). These features would also create 
periodic and alternating patches of charge in the protofibrils provided they are regular as 
envisaged (Fig. IB and 2B). These charged patches could guide and stabilize the higher order 
assemblies. Indeed, similar features have been noted in s everal natural fibrous proteins and 
have been implicated in the assembly of multi-protein filaments (J. J. Meng et al (1994) Biol 
Chem. 269, 18679; A. D. McLachlan and M. Stewart (1976) Mo I. Biol 103, 271), and small 
synthetic peptide systems (S. G. Zhang et al (1993) Proc. Natl. Sc. USA 90, 3334). The 
experiments with salt (KF) described above suggest that salt-bridges (electrostatic interaction) 
may be at least in part the cause of fibrillogenesis. 
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Claims 

1 . A protein structure comprising a plurality of first peptide monomer units arranged in a 
first strand and a plurality of second peptide monomer units arranged in a second strand 
in which a first peptide monomer unit in the first strand extends beyond a 
corresponding second peptide monomer unit in the second strand in the direction of the 
strands. 

2. A protein structure according to claim 1 in which the strands together form a coiled coil 
structure. 

3. A protein structure according to claim 1 or 2 in which at least one charged amino acid 
residue of a first peptide monomer unit is arranged to attract an oppositely-charged 
amino acid residue of a second peptide monomer unit. 

4. A protein structure according to claim 3 in which the charged amino acid residue is in 
an end portion of the first peptide monomer unit which extends beyond the 
corresponding second peptide monomer unit in the second strand. 

5. A polypeptide structure according to any preceding claim in which at least one strand 
consists solely of first or second peptide monomer units respectively. 

6. A protein structure according to any preceding claim in which the peptide monomer 
units comprise a repeating structural unit. 

7. A protein structure according to claim 6 in which the repeating structural unit 
comprises a heptad repeat motif (abcdefg). 

8. A protein structure according to claim 6 in which the repeating structural unit 
comprises a hendecad repeat motif (abcdefghijk) 

9. A protein structure according to claim 6 having isoleucine or asparagine at position a 
and leucine at position d. 

10. A protein structure according to claim 6 having valine or leucine at positions a and d 
respectively. 

11. A protein structure according to any one of claims 7 to 10 having oppositely-charged or 
otherwise complementary residues at positions g and e of respective monomer units. 

12. A protein structure according to claim 11 in which the oppositely-charged residues are 
glutamic acid and lysine residues or asparagine and aspartic acid, or synthetic 
derivatives. 
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13. A protein structure according to any preceding claim in which the structure is stabilised 
by pairs of asparagine, arginine, lysine or other complementary residues provided by 
corresponding first and second peptide monomer units. 

44^-protein-stocture-accor^ 

peptide monomer units have the sequence: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl) and 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2) respectively; or 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA) and 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ ( SAF-p2C) respectively; or 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-plC)and 

f) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C) respectively. 

15. A peptide monomer unit for use in preparing a protein structure the peptide monomer 
unit having an amino acid sequence selected from: 

a) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl); 

b) KIRALKAKNAHLKQEIAALEQEIAALEQ (SAF-p2); 

c) KIAALKQKIAALKQEIDALEYENDALEQ (SAF-plA); 

d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C); 

e) KIAALKQKIASLKQEIDALEYENDALEQ (SAF-pl C) ; and 
d) KIRALKWKNAHLKQEIAALEQEIAALEQ (SAF-p2C). 
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16. A method of producing protein structures, the method comprising providing a mixture 
of first and second peptide monomer units which associate to form a protein structure 
according to any preceding claim. 

17. A method according to claim 16 in which the protein structure is derivatised. 

18. A method according to claim 16 or 17 in which the protein structure is stabilised by 
cross-linking. 

19. Protein fibres produced by an association of protein structures according to any one of 
claims 1 to 3 or a method according to claim 16, 17 or 18. 

20. A kit for making protein structures, the kit comprising first and second peptide 
monomer units which associate to form a protein structure according to any one of 
claims 1 to 13 or protein fibres according to claim 19. 

21. A two dimensional matt comprising a protein structure according to any one of claims 1 
to 13 or protein fibres according to claim 19. 

22. A three dimensional grid comprising a protein structure according to any one of claims 
1 to 13 or protein fibres according to claim 19. 
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Protein fibre formation 



This invention relates to protein fibre formation and in particular to methods of producing 
protein fibres to form a protein structure comprising a plurality of first polypeptide units 
arranged in a first polypeptide strand and a plurality of second polypeptide units arranged in 
a second polypeptide strand, the strands preferably forming a coiled coil structure, and~ln~ 
which a first polypeptide unit in the first strand extends beyond a corresponding second 
polypeptide unit in the second strand in the direction of the strands. 
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